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METHODS FOR IDENTIFYING MODULATORS OF 
QUADRUPLEX NUCLEIC ACIDS 

Field of the Invention 

[0001] The invention concerns methods for identifying molecules that modulate a 
biological activity of a nucleic acid capable of forming secondary structures such as G- 
quadruplexes. 

Background 

[0002] Developments in molecular biology have led to an understanding of how certain 
therapeutic compounds interact with molecular targets and lead to a modified physiological 
condition. Specificity of therapeutic compounds for their targets is derived in part from 
interactions between complementary structural elements in the target molecule' and the 
therapeutic compound. A greater variety of target structural elements in the target leads to the 
possibility of unique and specific target/compound interactions. Because polypeptides are 
structurally diverse, researchers have focused on this class of targets for the design of specific 
therapeutic molecules. 

[0003] In addition to therapeutic compounds that target polypeptides, researchers also have 
identified compounds that target DNA. Some of these compounds are effective anticancer 
agents and have led to significant increases in the survival of cancer patients. Unfortunately, 
however, these DNA targeting compounds do not act specifically on cancer cells and therefore 
are extremely toxic. Their unspecific action may be due to the fact that DNA often requires the 
uniformity of Watson-Crick duplex structures for compactly storing information within the 
human genome. This uniformity of DNA structure does not offer a structurally diverse 
population of DNA molecules that can be specifically targeted. 

[0004] Nevertheless, there are some exceptions to this structural uniformity, as certain 
DNA sequences can form unique secondary structures. For example, intermittent runs of 
guanines can form G-quadmplex structures, andrcomplementary runs of cytosines can form 
i-motif structures. Formation of G-quadruplex and i-motif structures occurs when a particular 
region of duplex DNA transitions from Watson-Crick base pairing to intermolecular and 
intramolecular single-stranded structures. 
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Summary 

[0005] Certain regulatory regions in duplex DNA can transition into single-stranded G- 
quadruplex structures that regulate important biological processes, A gene in proximity to a G- 
quadruplex structure often is not appreciably transcribed into RNA, and certain proteins induce 
transcription and activation of the gene by facilitating the transition of quadruplex structures 
into transcribable structures. G-quadruplexes in intramolecular parallel (IP) conformations 
now have been identified as transcription regulators in certain nucleic acids. This discovery 
thereby allows for the design of candidate molecules capable of modulating the biological 
activity of nucleic acids. 

[0006] There are multiple intramolecular parallel conformations, examples of which are 
shown in Figures 1 and 2. As shown in these drawings, quadruplex structures formed by the 
CMYC NHE region (nucleotide sequence shown in Table 1) include tetrads joined by loops, 
and the orientation of each strand contributing to the tetrads are predominantly parallel. In the 
conformations depicted in Figure 1, the four strands forming the tetrads are purely parallel (all 
four strands have a 5' to 3' orientation in the same direction). In contrast, the G-quadruplex 
conformations depicted in Figure 3 are purely antiparallel as the strand polarity sequentially 
alternates {i.e., the orientation at positions 7 to 9 is South, the orientation at positions 1 1 to 13 
or 12 to 14 is North, the orientation at positions 16 to 18 is South, and the orientation at 
positions 20 to 22 or 21 to 23 is North). The antiparallel conformations shown in Figure 3 
conform to a folded-over hairpin, which often is referred to as a "chair" conformation. Like in 
Figure 1, intramolecular parallel structures are shown Figure 2. Three strands contributing to 
the tetrads of the two G-quadruplexes depicted in the middle of the drawing are parallel and 
one is antiparallel. The G-quadruplex depicted at the top right of Figure 2 also is 
intramolecular parallel, in which two parallel strands non-sequentially alternate with strands of 
reverse polarity (i.e. 9 the orientations at positions 12 to 14 and at positions 16 to 18 are the 
same). Data suggests that the three IP G-quadruplex conformations with mixed strand polarity 
in Figure 2 (i.e., two depicted in the middle and one depicted at the top right) are particularly 
relevant structures for modulating the biological activity of certain nucleic acids by interaction 
with quadruplex-stabilizing compounds. Accordingly, the methods described herein are 
especially applicable to these three quadruplex conformations. 

[0007] Thus, featured herein is a method for identifying a candidate molecule that interacts 
with a nucleic acid capable of forming a G-quadruplex structure, which comprises contacting a 
test molecule with a nucleic acid comprising or consisting of a G-quadruplex in an IP 
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conformation, and determining whether a portion of the test molecule is in proximity to and/or 
interacts with one or more sites in the G-quadruplex. Sites with which the test molecule 
interact often are proximally oriented G-tetrads, and sometimes are within one or more bases in 
a loop region. Multi-functional test molecules sometimes interact with other portions of the 
nucleic acid, such as with a site in the G-quadruplex and a site in a secondary structure formed 
by an i-motif sequence (i.e., a sequence complementary to the G-quadruplex forming 
sequence) and/or a duplex region located between G-quadruplex and i-motif structures. A test 
molecule having a portion in proximity to the site in the G-quadruplex is identified as a 
candidate molecule that modulates the biological activity of the nucleic acid. The method also 
is useful for identifying a candidate molecule that modulates a biological activity of the nucleic 
acid capable of forming a G-quadruplex structure. 

[0008] Also featured are methods for determining whether a candidate molecule modulates 
a biological activity of the nucleic acid comprising an IP quadruplex. In certain embodiments, 
an assay useful for monitoring the IP quadruplex structure is utilized (e.g., chemical 
footprinting and/or circular dichroism assays). In specific embodiments, one or more 
nucleotides in the IP quadruplex are substituted with a fluorescent molecule (e.g., 2-amino 
adenosine) and structurally distinct forms of the labeled nucleic acid are detected under various 
conditions (e.g., by a gel electrophoresis mobility shift assay), or interactions between the 
labeled nucleic acid and other molecules are detected (e.g., by monitoring shifts in 
fluorescence when test molecules and other molecules are added to a sample containing the 
labeled nucleic acid). 

[0009] In addition, provided are methods for stabilizing an IP G-quadruplex conformation 
in a nucleic acid, which comprises contacting a system with a quadruplex-interacting molecule, 
whereby the molecule stabilizes the nucleic acid in an IP G-quadruplex conformation. The 
quadruplex interacting molecule sometimes selectively stabilizes an DP conformation over a 
non-IP quadruplex conformation, and sometimes selectively stabilizes one IP conformation 
over another IP conformation. In certain embodiments, the quadruplex interacting molecule is 
one set forth in U.S. application no. 10/661,241 filed September 12, 2003, and in other 
embodiments, the molecule is TMPyP4, an expanded porphoryin analog such as Se2SAP, or 
telomestatin, or a derivative thereof. Examples of other quadruplex-interacting molecules are 
described hereafter. Such processes can be performed in a cell-free system or in a system 
having cells (e.g., in a cell culture dish or in an animal or human). Also featured are methods 
for treating a condition by administering a candidate molecule to a subject in need thereof. 
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Brief Description of the Drawings 

[0010] Figure 1 depicts in the top row G-quadruplex IP structural conformations in a 
nucleic acid from the CMYC NHE region, which are useful for identifying candidate molecules 
that modulate the biological activity of the nucleic acid. All four strands contributing to the G- 
tetrad core have parallel polarities. The IP conformations are distinguished by different 
nucleosides in loop regions (e.g., the thymidine at position 10 and the guanosine at position 1 1 
in the third structure from the left) and tetrad regions (e.g., the guanosine at position 13 in the 
second structure from the left and the guanosine at position 14 in the third structure from the 
left). 

[0011] Figure 2 shows G-quadruplex structural conformations in a nucleic acid from the 
CMYC NHE region stabilized by quadruplex-interacting molecules. The conformation 
designated "parallel" on the left side of the drawing is an IP structure depicted in Figure 1 . 
The conformations depicted in the middle top and bottom of the drawing are IP conformations , 
with three parallel strands and one antiparallel strand. The conformations in the middle top 
and bottom are distinguished by the orientation of particular strands forming the tetrad core, as 
well as the orientation of loops containing the thymidine at position 10 and the thymidine at 
position 19. The conformation depicted on the right side of the drawing is intramolecular 
parallel, in which two parallel strands non-sequentially alternate with strands of reverse 
polarity. 

[0012] Figure 3 depicts intramolecular antiparallel conformations that may form upon 
addition of a quadruplex interacting molecule. The antiparallel conformations shown form a 
folded-over hairpin structure referred to herein as a "chair." 

Detailed Description 

[0013] Featured herein are methods for identifying a candidate molecule that interacts with 
a nucleic acid capable of forming a G-quadruplex IP structure. Such methods are useful for 
identifying molecules that modulate the biological activity of the nucleic acid by narrowing the 
number of molecules screened in biological assays. Such methods also are useful for 
designing molecules that modulate a biological activity of the nucleic acid. The compounds 
identified by the methods described herein are useful for identifying molecular therapeutics 
that reduce cell proliferation and/or microbial (e.g., bacterial or viral) proliferation, and in 
certain embodiments, treat a condition associated with the G-quadruplex in a subject. 
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Nucleic Acids 

[0014] Candidate molecules are identified by contacting a test molecule with a nucleic acid 
and determining whether the test molecule interacts with the nucleic acid and/or modulates a 
biological activity of the nucleic acid. Nucleic acids often comprise or consist of DNA {e.g., 
genomic DNA (gDNA) or complementary DNA (cDNA)) or RNA (e.g., mRNA, tRNA, and 
rRNA). In embodiments where a nucleic acid is a gDNA or cDNA fragment, the fragment 
often is 50 or fewer, 100 or fewer, or 200 or fewer base pairs in length, and sometimes is about 
300, about 400, about 500, about 600, about 700, about 800, about 900, about 1000, about 1 100, 
about 1200, about 1300, or about 1400 base pairs in length. In an embodiment, the nucleic acid is 
i double-stranded, and is sometimes between about 30 nucleotides to about 40 nucleotides in 
length. Methods for generating gDNA and cDNA fragments are known in the art (e.g., gDNA 
may be fragmented by shearing methods and cDNA fragment libraries are commercially 
available). In embodiments where the nucleic acid is a synthetically prepared fragment nucleic 
acid, often referred to as an "oligonucleotide," the fragment sometimes is less than 30, less than 
40, less than 50, less than 60, less than 70, less than 80, less than 90, or less than 100 
nucleotides in length. Synthetic oligonucleotides can be synthesized using standard methods 
and equipment, such as by using an ABI™3900 High Throughput DNA Synthesizer, which is 
available from Applied Biosystems (Foster City, CA). 

[0015] Nucleic acids sometimes comprise or consist of analog or derivative nucleic acids, 
such as peptide nucleic acids (PNA) and others exemplified in U.S. Patent Nos. 4,469,863; 
5,536,821; 5,541,306; 5,637,683; 5,637,684; 5,700,922; 5,717,083; 5,719,262; 5,739,308; 
5,773,601; 5,886,165; 5,929,226; 5,977,296; 6,140,482; WIPO publications WO 00/56746 and 
WO 01/14398, and related publications. Methods for synthesizing oligonucleotides 
comprising such analogs or derivatives are disclosed, for example, in the patent publications 
cited above, in U.S. Patent Nos. 5,614,622; 5,739,314; 5,955,599; 5,962,674; 6,117,992; and in 
WO 00/75372. 

[0016] The nucleic acids often include a nucleotide sequence capable of forming a 
secondary structure. Examples of secondary structures are quadruplex structures, which form 
from subsequences rich in purines (e.g., guanines in G-quadruplex structures), and i-motif 
structures, which form from subsequences rich in pyrimidines (e.g., cytosines). Secondary 
structures can exist in different conformations, which differ in strand stoichiometry and/or 
strand orientation. For example, secondary structures sometimes are formed by interstrand 
interactions, in which the interacting strands are in the same direction (i.e., parallel, the 
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interacting strands are oriented 5' to 3') or in different directions (i.e. 9 antiparallel, the 
interacting strands are oriented 5' to 3' and 3' to 5'), and sometimes are formed by intrastrand 
interactions, examples of the latter of which are described in Figure 1. Quadruplex structures 
sometimes form because certain purine rich strands are capable of engaging in a slow 
equilibrium between a typical duplex helix structure and both unwound and non-B-form 
substructures. These unwound and non-B forms sometimes are referred to as "paranemic 
structures," and some forms are associated with sensitivity to SI nuclease digestion, which 
sometimes are referred to as "nuclease hypersensitivity elements" or <e NHEs." A quadruplex is 
one type of paranemic structure and certain NHEs can adopt a quadruplex structure. The entire 
length of the nucleic acid sometimes participates in the quadruplex structure, and a portion of 
the nucleic acid length {i.e., a subsequence) often forms a quadruplex structure. 

[0017] The ability of guanine-rich nucleic acids of adopting G-quadruplex conformations 
is due to the formation of guanine tetrads through Hoogsteen hydrogen bonds. One nucleic 
acid sequence can give rise to different quadruplex orientations, where the different 
conformations depend upon conditions under which they form, such as the concentration of 
potassium ions present in the system and the time that the quadruplex is allowed to form. 
Different quadruplex conformations can be distinguished from one another using standard 
procedures such as chemical footprinting and circular dichroism analyses (see e.g., U.S. 
application no. 10/407,449 filed April 4, 2003). Also, multiple conformations can be in 
equilibrium with one another, and can be in equilibrium with a duplex conformation if a 
complementary strand exists in the system. For example, the IP conformations described 
herein may be in equilibrium with a chair conformation (i.e. 9 the latter conformation having 
bridging loops running orthogonal to two parallel loops and resulting from the simple folding- 
over of a DNA G-hairpin) or a basket conformation. The equilibrium may be shifted to favor 
one conformation over another such that the favored conformation is present in a higher 
concentration or fraction over the other conformation or other conformations. A certain 
conformation also may be trapped, for example, by selectively binding one conformation over 
others by a compound that stabilizes the particular conformation. The terms "favor" and "trap" 
as used herein refer to one conformation being at a higher concentration or fraction relative to 
other conformations, and also refer to stabilizing the particular quadruplex conformation. The 
terms "hinder" or "non-trapped" as used herein refer to one conformation being at a lower 
concentration with respect to other conformations. One conformation may be favored or 
trapped over another conformation if it is present in the system at a fraction of 50% or greater, 



6 



WO 2005/049867 



PCT/US2004/037789 



75% or greater, or 80% or greater or 90% or greater with respect to another conformation (e.g., 
another quadruplex conformation, another paranemic conformation, or a duplex conformation). 
Conversely, one conformation may be hindered or not trapped if it is present in the system at a 
fraction of 50% or less, 25% or less, or 20% or less and 10% or less, with respect to another 
conformation. 

[0018] Equilibrium can be shifted to favor one quadruplex form over another by employing 
a variety of methods. For example, certain bases in a quadruplex nucleic acid may be mutated 
to prevent the formation of one conformation. Typically, these mutations are located in tetrad 
regions of the quadruplex (i.e., regions in which four bases interact with one another in a 
planar orientation). Also, ion concentrations and the time with which a quadruplex nucleic 
acid is contacted with certain ions can favor one conformation over another. For example, 
potassium ions stabilize quadruplex structures, and higher concentrations of potassium ions 
and longer contact times of potassium ions with a quadruplex nucleic acid can favor one 
conformation over another. A particular quadruplex conformation, such as IP and chair 
confoimations, can be favored with contact times of 5 minutes or less in solutions containing 
100 mM potassium ions, and often 10 minutes or less, 20 minutes or less, 30 minutes or less, 
and 40 minutes or less. Basket conformations typically require longer contact times with 
potassium ions. Potassium ion concentration and the counter anion can vary, and the specific 
quadruplex conformations existing for a given set of conditions can be determined. 
Furthermore, different quadruplex structures may be distinguished, trapped and favored by 
probing them with molecules that favorably interact with one quadruplex form over another 
(e.g., TMPyP4 binds with a higher affinity to chair structures as opposed to basket and IP 
structures). Quadruplex-interacting compounds sometimes bind with higher affinity to 
particular quadruplex structures in vitro than in vivo (e.g., the compound telomestatin binds to 
IP and chair structures in vitro but more favorably to chair structures in vivo). 

[0019] Particular nucleotide sequences in a nucleic acid sometimes direct the type of 
secondary structure or structures that the nucleic acid is capable of adopting. For example, 
nucleic acid sequences conforming to the motif (G a X b )cG a sometimes form an intramolecular 
IP or antiparallel G-quadruplex structure. Sometimes a is an integer between 2 and 6 and b is 
an integer between 1 and 4, and often, b is the integer 2 or 3. A nucleic acid often includes one 
or more flanking nucleotides on the 5' and/or 3' end of the nucleotide sequence that forms the 
quadruplex and are not part of the quadruplex structure. These motifs can be used to identify 
other quadruplex-forming sequences in regions of a genome operably linked to a gene. G- 
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quadruplexes formed by sequences conforming to this motif sometimes include 2 to 6 G- 
tetrads, and often include between 3 and 5 G-tetrads. 

[0020] Sometimes, a nucleic acid capable of forming one or more secondary structures 
includes a nucleotide sequence identical to a native nucleotide sequence present in genomic 
DNA. For example, a nucleic acid often comprises or sometimes consists of a nucleotide 
sequence or a portion of a nucleotide sequence set forth in Table 1 . The nucleotide sequences 
in Table 1 originate from regions in genomic DNA that are capable of forming a quadruplex 
structure, which can regulate transcription of the CMYC, PDGFA, PDGFB/c-sis, c-ABL, RET, 
BCL-2, Cyclm Dl/BCL-1, KRAS, HRASasxd VEGF. 



TABLE 1 



Sequence 


SEQ 
ID NO 


Origin 


TG4AG3TG4AG3TG4AAGG 


1 




Gi 3 CG 5 CG 5 CG 5 AG4T 


2 


PDGFA 


G8ACGCG3AGCTG5AG3CTTG4CCAG3CG4CGCTTAG5 


3 


PDGFB/c-sis 


AGGAAG4AG3CCG6AGGTGGC 


4 


c-ABL 


G 5 (CG 4 ) 3 


5 


RET 


G3AGGAAG5CG3AGTCG4 


6 


BCL-2 


G4ACGCG3CG5CG6AG3CG 


7 


Cyclin Dl/BCL-1 


(G 3 A)3AGGA(G 3 A)4GC 


.8 


K-RAS 


G 5 (CG 4 ) 3 


9 


H-RAS 


C3G4CG3C2G5CG4TC3G2CG5CG2AG 


10 


VEGF 



While quadruplex forming sequences typically are identified in regulatory regions upstream of 
a gene (e.g., a promoter or a 5' untranslated region (UTR)), quadruplex forming sequences also 
may be identified within a 3' UTR or within an intron or exon of a gene. 

[0021] A nucleic acid sometimes includes a nucleotide sequence similar to or substantially 
identical to a native nucleotide sequence. A similar or substantially identical nucleotide 
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sequence may include modifications to the native sequence, such as substitutions, deletions, or 
insertions of one or more nucleotides. The substantially identical sequence sometimes 
conforms to the (G a X b ) c G a motif described above. The term "substantially identical" refers to 
two or more nucleic acids sharing one or more identical nucleotide sequences. Included are 
nucleotide sequences that sometimes are 55%, 60%, 65%, 70%, 75%, 80%, or 85% identical to 
a native quadruplex-forming nucleotide sequence, and often are 90% or 95% identical to the 
native quadruplex-forming nucleotide sequence (each identity percentage can include a 1%, 
2%, 3% or 4% variance). One test for determining whether two nucleic acids are substantially 
identical is to determine the percentage of identical nucleotide sequences shared between the 
nucleic acids. 

[0022] Calculations of sequence identity can be performed as follows. Sequences are 
aligned for optimal comparison purposes and gaps can be introduced in one or both of a first 
and a second nucleic acid sequence for optimal alignment. Also, non-homologous sequences 
can be disregarded for comparison purposes. The length of a reference sequence aligned for 
comparison purposes sometimes is 30% or more, 40% or more, 50% or more, ofl;en 60% or 
more, and more often 70%, 80%, 90%, 100% of the length of the reference sequence. The 
nucleotides at corresponding nucleo;tide positions then are compared among the two sequences. 
When a position in the first sequence is occupied by the same nucleotide as the corresponding 
position in the second sequence, the nucleotides are deemed to be identical at that position. 
The percent identity between the two sequences is a function of the number of identical 
positions shared by the sequences, taking into account the number of gaps, and the length of 
each gap, introduced for optimal alignment of the two sequences. 

[0023] Comparison of sequences and determination of percent identity between two 
sequences can be accomplished using a mathematical algorithm. Percent identity between two 
nucleotide sequences can be determined using the algorithm of Meyers & Miller, CABIOS 
4: 1 1-17 (1989), which has been incorporated into the ALIGN program (version 2.0), using a 
PAM120 weight residue table, a gap length penalty of 12 and a gap penalty of 4. Percent 
identity between two nucleotide sequences can be determined using the GAP program in the 
GCG software package (available at http address www.gcg.com), using a NWSgapdna.CMP 
matrix and a gap weight of 40, 50, 60, 70, or 80 and a length weight of 1, 2, 3, 4, 5, or 6. A set 
of parameters often used is a Blossum 62 scoring matrix with a gap open penalty of 12, a gap 
extend penalty of 4, and a frame shift gap penalty of 5. 
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[0024] Another manner for determining if two nucleic acids are substantially identical is to 
assess whether a polynucleotide homologous to one nucleic acid will hybridize to the other 
nucleic acid under stringent conditions. As use herein, the term "stringent conditions" refers to 
conditions for hybridization and washing. Stringent conditions are known to those skilled in 
the art and can be found in Current Protocols in Molecular Biology, John Wiley & Sons, N.Y., 
6.3.1-6.3.6 (1989). Aqueous and.non-aqueous methods are described in that reference and 
either can be used. An example of stringent conditions is hybridization in 6X sodium 
chloride/sodium citrate (SSC) at about 45°C, followed by one or more washes in 0.2X SSC, 
0.1% SDS at 50°C. Another example of stringent conditions are hybridization in 6X sodium 
chloride/sodium citrate (SSC) at about 45°C, followed by one or more washes in 0.2X SSC, 
0.1% SDS at 55°C. A further example of stringent conditions is hybridization in 6X sodium 
chloride/sodium citrate (SSC) at about 45°C, followed by one or more washes in 0.2X SSC, 
0.1% SDS at 60°C. Often, stringent conditions are hybridization in 6X sodium 
chloride/sodium citrate (SSC) at about 45°C, followed by one or more washes in 0.2X SSC, 
0.1% SDS at 65°C. Also, stringency conditions include hybridization in 0.5M sodium 
phosphate, 7% SDS at 65°C, followed by one or more washes at 0.2X SSC, 1% SDS at 65°C. 

[0025] Also, nucleotide sequences of native quadruplex-forming nucleotide sequences may 
be used as "query sequences" to perform a search against public databases to identify related 
sequences. Such searches can be performed using the NBLAST and XBLAST programs 
(version 2.0) of Altschul, et al, J. Mol Biol 215:403-410 (1990). BLAST nucleotide searches 
can be performed with the NBLAST program, score = 100, wordlength = 12 to obtain 
homologous nucleotide sequences. To obtain gapped alignments for comparison purposes, 
Gapped BLAST can be utilized as described in Altschul, et al, Nucleic Acids Res. 
25(17):3389-3402 (1997). When utilizing BLAST and Gapped BLAST programs, default 
parameters of the respective programs (e.g., XBLAST and NBLAST) can be used (see, http 
address www.ncbi.nlm.nih.gov). 

Contacting Nucleic Acids with Test Molecules 
[0026] The nucleic acid is contacted with a test molecule, the latter of which is described in 
further detail hereafter, in a variety of manners. In certain embodiments, a physical 
representation of the nucleic acid and the test molecule is contacted and interactions between 
the physical representations are observed. In other embodiments, a representation of the 
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nucleic acid is contacted with a representation of a test molecule on paper or in silico (i.e., in a 
computer environment). 

[0027] In in silico processes, three-dimensional representations of the nucleic acid and the 
test molecule often are brought into proximity with one another. The three dimensional 
representations of the nucleic acid and the test molecule are oriented with one another to 
maximize favorable interactions (e.g., ionic interactions, hydrogen bonds and/or hydrophobic 
interactions) and minimize unfavorable interactions (e.g., steric clashes) between portions of 
the test molecule and portions of the nucleic acid, which minimizes the energy of the complex. 
The contacting process sometimes is performed manually (e.g., the orientation of three- 
dimensional representations of the nucleic acid and the test molecule are manipulated on a 
computer by a user), sometimes is accomplished in an automated manner using computer 
software (e.g., the software docks the three-dimensional representations of the nucleic acid and 
the test molecule and then energy minimizes the resulting structures), and often is 
accomplished by a combination of a manual and automated method. Commercially available 
computer software is available for carrying out these processes (see, e.g., AutoDock, DOCK, 
Dock/Insightll, FlexX, MULTIDOCK, FTDOCK and GRAMM computer programs; see also 
http address www.molvis.chem.indiana.edu/ app_guide/ for other computer programs useful 
for visualizing three dimensional representations of molecules and manipulating them). Using 
such software, the test molecule sometimes is part of a molecular library from which a 
computer program often samples test molecules iteratively and docks them with a nucleic acid. 
In other embodiments, a computer program is utilized to build test molecules having portions 
that meet specific criteria. In an example of the latter embodiments, a test molecule can be 
built by specifying that a hydrophobic moiety must be in proximity to a duplex region located 
between a G-quadruplex structure and an i-motif and also specify that a moiety capable of 
hydrogen bonding with nucleotides in a G-quadruplex structure be present and specifically 
oriented in the test molecule. 

[0028] The nucleic acid sometimes is contacted with a nucleic acid binding protein in 
silico by docking three-dimensional renditions of the two molecules to one another. 
Three-dimensional renditions of test molecules sometimes are contacted with the three- 
dimensional rendition of the docked nucleic acid/protein structure and sometimes the test 
molecule or candidate molecule is contacted with the undocked nucleic acid and/or binding 
protein before the nucleic acid and protein are docked to one another. In an embodiment, the 
nucleic acid comprises or consists of a structure set forth in Figure 1 . In another embodiment, 
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the nucleic acid is contacted with one or more nucleic acid binding protein, such as NM23-H2 
. (see e.g., deposit reference 1NUE at http address rutgers.rcsb.org/pdb/ for coordinates of a 
three-dimensional representation of an NM23-H2 protein) and/or another nucleic acid binding 
protein (e.g., CNBP, hnRNPic or Spl). In certain embodiments, the nucleic acid is docked with 
a portion of a nucleic acid binding protein before, during, or after the nucleic acid and/or the 
nucleic acid binding protein portion are contacted with a test molecule or candidate molecule. 

[0029] Based in part upon the proximity of specific atoms in the test molecule and in the 
nucleic acid (e.g., distance between the atoms), it can be determined whether one or more 
atoms in the test molecule and in the nucleic acid interact with one another. Atoms in the two 
molecules can interact with one another to form hydrogen bond interactions, Van der Waals 
interactions, ionic interactions, and hydrophobic interactions, for example. A portion of a test 
molecule often is in proximity to a site in the nucleic acid molecule when the closest atoms in 
the two molecules are within a distance of about 8 A from one another, about 5 A from one 
another, about 4 A from one another, about 3 A from one another, about 2 A from one another, 
about 1 A from one another, or about 0.5 A from one another. The molecules sometimes are 
determined as being in proximity to one another based upon a relative distance, where the 
distance between an atom- in the test molecule and an atom at a site of the nucleic acid is closer 
to one another than the distance between other test molecule atoms and nucleotide atoms. 

[0030] Candidate molecules are selected from test molecules meeting certain criteria. In 
an embodiment, one criterion is that a portion of the test molecule is in proximity to a site and 
optionally interacts with a site comprising one or more atoms of a nucleoside in a G-tetrad of 
the G-quadruplex IP structure. For example, a portion of the test molecule may be in 
proximity and optionally interact with a portion of a guanosine in a tetrad, such as a tetrad 
formed by guanosines at positions 9, 13, 14, 18, 22, 23 or positions 7, 11, 12, 16, 20, 21 or a 
combination thereof in the IP quadruplex structures of Figure 1. A portion of the test molecule 
also may be in proximity to and optionally interact with one or more atoms in a nucleoside 
located in a loop region (e.g., an atom in a thymidine at position 10, a guanosine at position 11, 
a guanosine at position 14, an adenosine at position 15, a thymidine at position 19, a guanosine 
at position 20, or a combination thereof in the loop regions of the G-quadruplex IP structures in 
Figure 1), and sometimes, may be in proximity to or optionally interact with one or more atoms 
in a nucleoside not in a loop or tetrad (e.g., an adenosine at position 6 or a guanosine at 
position 23 in Figure 1). 
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[0031] Of the intramolecular parallel conformations having three parallel strands and one 
antiparallel strand, a portion of the test molecule may be in proximity to and optionally interact 
with one or more atoms of a tetrad guanosine, such as a guanosine in a tetrad at position 7, 13, 
18 or 22; 9, 13, 18 or 20; 9, 1 1, 16 or 20; or 7 ,1 1, 16 or 22 as shown in Figure 2 or the like. 
For the latter G-quadruplex conformations, a portion of the test molecule may be in proximity 
to and optionally interact with one or more atoms of a loop nucleoside, such as a thymidine at 
position 10, a guanosine at position 1 1, an adenosine at position 15, a thymidine at position 19, 
and a guanosine at position 20, and sometimes, may be in proximity to or optionally interact 
with one or more atoms in a nucleoside not in a loop or tetrad, such as an adenosine at position 
6. 

[0032] Of intramolecular G-quadruplex conformations having mixed strand polarity, such 
as the conformation shown on the right of Figure 2 and the like, a portion of the test molecular 
may in proximity to and optionally interact with one or more atoms of a tetrad guanosine, such 
as a guanosine in a tetrad at position 7, 14, 18 or 21 or position 9, 12, 16 or 23. A portion of 
the test molecule also may be in proximity to and optionally interact with one or more atoms in 
a nucleoside located in a loop region (e.g., an atom in a thymidine at position 10, a guanosine 
at position 1 1, an adenosine at position 15, a thymidine at position 19, a guanosine at position 
20, or a combination thereof), and sometimes, may be in proximity to or optionally interact 
with one or more atoms in a nucleoside not in a loop or tetrad (e.g., an adenosine at position 6). 

[0033] The methods described herein also are applicable to intramolecular antiparallel 
conformations, such as those shown in Figure 3 and the like. A portion of a test molecule may 
be in proximity to and optionally interact with one or more atoms in a tetrad guanosine, such as 
a guanosine in a tetrad at position 7, 13, 16 or 22; 7, 13, 16 or 23; 7, 14, 16 or 22; 7, 14, 16 or 
23; 9, 11, 18 or 20; 9, 11, 18 or 21; 9, 12, 18 or 20; or 9, 12, 18 or 21. A portion of the test 
molecule may be in proximity to and optionally interact with one or more atoms of a loop 
nucleoside in an intramolecular antiparallel structure, such as a thymidine at position 10, a 
guanosine at position 1 1, a guanosine at position 14, an adenosine at position 15, a thymidine 
at position 19 and/or a guanosine atposition 20,-and sometimes, may be in proximity to or 
optionally interact with one or more atoms in a nucleoside not in a loop or tetrad, such as an 
adenosine at position 6 or a guanosine at position 23. 

[0034] Nucleic acid subsequence having sufficient guanosines to form G-quadruplex 
tetrads can be aligned to form the conformations depicted in Figure 1 and Figure 2, even where 
the nucleotide subsequence differs from the CMYC NHE region subsequence and the 



13 



WO 2005/049867 



PCT/US2004/037789 



nucleosides and number of nucleosides not forming the tetrad core differ. For example, 
nucleotide subsequences having sufficient guanosines to form G-tetrads in a quadruplex, such 
as those shown in Table 1, can be assembled into the nucleic acid configurations shown in 
Figures 1 and 2. 

[0035] Test molecules that interact with more than one atom in a nucleoside participating 
in a G-tetrad or a loop often are identified as candidate molecules. In addition to interacting 
with atoms in a G-quadruplex, a candidate molecule sometimes interacts with one or more 
atoms in an adjacent secondary structure. An adjacent secondary structure sometimes is 
formed by an i-motif complementary to the nucleotide sequence that forms the G-quadruplex. 
In other embodiments, the adjacent secondary structure is formed from a nucleotide sequence 
located upstream or downstream of the same strand having the G-quadruplex-forming 
nucleotide sequence that interacts with the G-quadruplex (e.g., a silencer sequence in 
association with a G-quadruplex formed by a nucleotide sequence located 5' of the PDGF 
receptor open reading frame; see e.g., U.S. application no. 60/425,869 filed November 12, 
2002). The candidate molecule sometimes forms one or more hydrogen bonds and/or 
sometimes forms a hydrophobic interaction {e.g., end-stacks or intercalates) with one or more 
atoms in a G-quadruplex structure and/or an adjacent secondary structure. A portion of a 
candidate molecule sometimes forms a hydrogen bond and/or intercalates with one or more 
atoms in a duplex region flanking the G-quadruplex and/or the adjacent secondary structure. 

[0036] In addition to interactions with one IP quadruplex conformation, a candidate 
molecule sometimes interacts with another G-quadruplex conformation, such as another IP 
conformation or an intramolecular antiparallel conformation. In certain embodiments, a 
candidate molecule interacts with a nucleic acid when the latter is in an IP quadruplex 
conformation having four parallel strands contributing to the tetrad core and in an IP 
quadruplex conformation having two or three parallel strands contributing to the tetrad core. 
In another embodiment, the candidate molecule interacts with an IP quadruplex conformation 
having four parallel strands with a lower affinity than an IP quadruplex conformation having 
two or three parallel strands. In another embodiment, a candidate molecule identified as 
interacting with an IP quadruplex is tested to determine whether it can interact with a non-IP 
quadruplex form or structure (e.g., antiparallel, basket, intermolecular). 

[0037] Test molecules can be optimized to form specific interactions with one or more 
sites of the nucleic acid. For example, a test molecule can be designed or optimized to include 
a chemical moiety capable of forming an interaction with an epsilon-amino moiety from a 
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lysine in the nucleic acid binding protein (e.g., lysine 12 in a subunit of NM23-H2), thereby 
inhibiting relaxation of secondary structures in the nucleic acid, inhibiting transcription of the 
CMYC gene, and inhibiting or reducing cell proliferation. In the latter embodiment, such an 
inhibitor sometimes acts as a suicide inhibitor which traps the binding protein in an 
unproductive conformation. The test molecule also can be designed or optimized to include 
portions that interact with the G-quadruplex structure and/or an i-motif structure and optionally 
to include a moiety that intercalates with the intermediate duplex region or part of the duplex 
region. Intercalation moieties often are hydrophobic and examples of such moieties are known 
(e.g., ethidium bromide and other moieties described below). 

Test Molecules and Candidate Molecules 
[0038] The nucleic acid is contacted with one or more test molecules to identify candidate 
molecules that modulate a biological activity of the nucleic acid. Molecules often are organic 
or inorganic compounds having a molecular weight of 10,000 grams per mole or less, and 
sometimes having a molecular weight of 5,000 grams per mole or less, 1,000 grams per mole 
or less, or ,500 grams per mole or less. Also included are salts, esters, and other 
pharmaceutically acceptable forms of the compounds. Compounds that interact with nucleic 
acids are known in the art (see, e.g., Hurley, Nature Rev. Cancer 2:188-200 (2002); Anantha, 
et ah, Biochemistry Vol. 37, No. 9:2709-2714 (1998); and Ren, et al, Biochemistry 38:16067- 
16075 (1999)). 

[0039] Compounds can be obtained using known combinatorial library methods, including 
spatially addressable parallel solid phase or solution phase libraries; synthetic library methods 
requiring deconvolution; "one-bead one-compound" library methods; and synthetic library 
methods using affinity chromatography selection. Examples of methods for synthesizing 
molecular libraries are described, for example, in DeWitt, et ah, Proc. Natl. Acad. Set U.S.A. 
90:6909 (1993); Erb, et al., Proc. Natl. Acad. Sci. USA 91:1 1422 (1994); Zuckermann, et al, 
J. Med. Chem. 37:2678 (1994); Cho, et al, Science 261:1303 (1993); Carrell, et al, Angew. 
Chem. Int. Ed. Engl. 33:2059 (1994); Carell, etal, Angew. Chem. Int. Ed. Engl. 33:2061 
(1994); and Gallop, etal, J. Med. Chem. 37:1233 (1994). 

[0040] In addition to an organic and inorganic compound, a molecule sometimes is a 
nucleic acid, a catalytic nucleic acid (e.g., a ribozyme), an inhibitory RNA (RNAi or siRNA), a 
nucleotide, a nucleotide analog, a polypeptide, an antibody, or a peptide mimetic. Methods for 
making and using these molecules are known. For example, methods for making ribozymes 
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and assessing ribozyme activity are described (see e.g., U.S. Patent Nos. 5,093,246; 4,987,071; 
and 5,116,742; Haselhoff & Gerlach, Nature 334:585-591 (1988) and Bartel & Szostak, 
Science 261:1411-1418 (1993)). Also, methods for generating siRNA are known (see e.g., 
Elbashir, et al t . Methods 26:199-213 (2002) and http address www.dharmacon.com) and 
peptide mimetic libraries are described (see, e.g., Zuckermann, et al., J. Med. Chem. 
37:2678-2685 (1994)). 

[0041] Candidate molecules sometimes meet the following structural and chemical criteria. 
A candidate molecule sometimes interacts with one region of the nucleic acid or the nucleic 
acid binding protein, and sometimes interacts with two or more regions of the nucleic acid 
and/or the nucleic acid binding protein (e.g., the candidate molecule can interact with two or 
more regions in the nucleic acid or nucleic acid binding protein, or can interact with a region in 
the nucleic acid and a region in the nucleic acid binding protein). Each molecule sometimes 
includes a moiety that interacts with a G-quadruplex structure, an i-motif structure and/or a 
duplex structure of the nucleic acid, and/or an amino acid of the nucleic acid binding protein. 
In an embodiment, a moiety in the molecule is capable of end-stacking with or intercalating 
between one or more G-tetrads of a G-quadruplex, such as a moiety comprising a planar or 
polycyclic structure, for example. Examples of such moieties are anthraquinone, acridone, 
napthyl, pheoxazine, xanthone, benzoxazole, phenathiazine, phenazine, benzothiazole, 
acridine, dibenzofuran, benzimidazole, fluorenone, fluorene, and phenanthroline. In another 
embodiment, a moiety is a duplex DNA intercalated, capable of binding to a duplex DNA 
region adjacent to a secondary structure in the nucleic acid, such as a moiety having a planar or 
polycyclic structure (e.g., an intercalate* listed previously). In a related embodiment, the 
moiety is capable of groove-binding to a duplex region in the nucleic acid, such as a 
polypeptide or sugar-based moiety capable of groove binding. In other embodiments, a moiety 
is capable of binding to an amino acid of a nucleic acid binding protein (e.g., NM23 protein), 
such as a nucleotide or a nucleotide mimetic, or a carbonyl-, acetal-, or mine-containing 
moiety. Molecules having quadruplex-interacting moieties are disclosed in application no. 
10/407,449 filed April 4, 2003; application no. 10/660,897 filed September 1 1, 2003; 
application no. 10/661,241 filed September 12, 2003; application no. 60/463,171 filed April 
15, 2003 and application no. 60/461,205 filed April 7, 2003. 

[0042] Each molecule that interacts with two or more target regions in a nucleic acid 
and/or a nucleic acid binding protein sometimes comprises two or more moieties that 
independently interact with the target regions and are joined by a linker. A linker joining the 
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moieties sometimes is 7.5 A to 40 A in length, sometimes comprises between 5 and 20 atoms, 
often is flexible, and sometimes is constrained (e.g., in a conformation that follows the groove 
of duplex DNA). The linker sometimes comprises polyamide or polysaccharide moieties (e.g., 
comprising amino saccharide units), and sometimes includes known linkage functionalities 
such as those independently selected from amide, ester, ether, amine, sulfide, sulfonamide, 
alkyl or aryl, for example. 

[0043] Featured herein is structural information descriptive of the candidate molecules and 
therapeutics identified by the processes described herein as modulating the biological activity 
of a nucleic acid capable of adopting a G-quadruplex IP conformation. As described above, 
the candidate molecule or therapeutic may modulate the biological activity by interacting with 
G-quadruplexes in other conformations, such as the chair conformation for example. In certain 
embodiments, information descriptive of candidate molecule structure (e.g., chemical formula 
or sequence information) sometimes is stored and/or renditioned as an image or as three- 
dimensional coordinates. The information often is stored and/or renditioned in computer 
readable form and sometimes is stored and organized in a database. In certain embodiments, 
the information may be transferred from one location to another using a physical medium (e.g., 
paper) or a computer readable medium (e.g., optical and/or magnetic storage or transmission 
medium, floppy disk, hard disk, random access memory, computer processing unit, facsimile 
signal, satellite signal, transmission over an internet or transmission over the world-wide web). 

Biological Activity of Candidate Molecules 

[0044] Determining whether the biological activity of a native quadruplex DNA is 
modulated in a cell, tissue, or organism can be accomplished by monitoring modulation of a 
signal in an in vitro or in vivo assay. The signal in the assay sometimes is generated or 
modulated by transcription of the nucleic acid, binding of a protein to the nucleic acid (e.g., 
NM23-H2), a fluorophore incorporated in the nucleic acid, or cell proliferation, for example. 
Transcription can be detected, for example, by directly detecting RNA transcripts or detecting 
polypeptides translated by transcripts, which are methods known in the art. 

[0045] Candidate molecules, some identified by the procedures described herein, 
sometimes are screened in in vitro or in vivo assays to determine whether they modulate the 
biological activity of the nucleic acid. Candidate molecules and nucleic acids can be added to 
an assay system in any order to determine whether the candidate molecule modulates the 
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biological activity of the nucleic acid. For example, a candidate molecule sometimes is added 
to an assay system before, simultaneously, or after a nucleic acid is added. 

[0046] In these assays, candidate molecules are contacted with the nucleic acid in the 
assay system, where the term "contacting" refers to placing a candidate molecule in close 
proximity to a nucleic acid and allowing the assay components to collide with one another, 
often by diffusion. Contacting these assay components with one another can be accomplished 
by adding them to a body of fluid or in a reaction vessel, for example. The components in the 
system may be mixed in variety of manners, such as by oscillating a vessel, subjecting a vessel 
to a vortex generating apparatus, repeated mixing with a pipette or pipettes, or by passing fluid 
containing one assay component over a surface having another assay component immobilized 
thereon, for example. 

[0047] As used herein, the term "system" refers to an environment that receives the assay 
components, which includes, for example, microtiter plates {e.g., 96-well or 384-well plates), 
silicon chips having molecules immobilized thereon and optionally oriented in an array (see, 
e.g., U.S. Patent No. 6,261,776 and Fodor, Nature 364:555-556 (1993)), and microfluidic 
devices {see, e.g., U.S. Patent Nos. 6,440,722; 6,429,025; 6,379,974; and 6,316,781). The 
system can include attendant equipment for carrying out the assays, such as signal detectors, 
robotic platforms, and pipette dispensers. 

[0048] One or more assay components (e.g., the nucleic acid, candidate molecule or 
nucleic acid binding protein) sometimes are immobilized to a solid support. The attachment 
between an assay component and the solid support often is covalent and sometimes is non- 
covalent (see, e.g., U.S. Patent No. 6,022,688 for non-covalent attachments) and the 
attachment sometimes is by a linking moiety. The solid support often is one or more surfaces 
of the system, such as one or more surfaces in each well of a microtiter plate, a surface of a 
silicon wafer, a surface of a bead (see, e.g., Lam, Nature 354: 82-84 (1991)) optionally linked 
to another solid support, or a channel in a microfluidic device, for example. Types of solid 
supports, linker molecules for covalent and non-covalent attachments to solid supports, and 
methods for immobilizing nucleic acids and other molecules to solid supports are known (see, 
e.g., U.S. Patent Nos. 6,261,776; 5,900,481; 6,133,436; and 6,022,688; and WO publication 
WO 01/18234). 

[0049] Protein molecules sometime are contacted with the nucleic acid. Polypeptide 
molecules sometimes are added to the system in free form, and sometimes are linked to a solid 
support or another molecule. For example, polypeptide test molecules sometimes are linked to 
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a phage via a phage coat protein. The latter embodiment often is accomplished by using a 
phage display system, where nucleic acids linked to a solid support are contacted with phages 
that display different polypeptide candidate molecules. Phages displaying polypeptide 
candidate molecules that interact with the immobilized nucleic acids adhere to the solid 
• support, and phage nucleic acids corresponding to the adhered phages then are isolated and 
sequenced to determine the sequence of the polypeptide test molecules that interacted with the 
immobilized nucleic acids. Methods for displaying a wide variety of peptides or proteins as 
fusions with bacteriophage coat proteins are known (Scott and Smith, Science 249:386-390 
(1990); Devlin, Science 249:404-406 (1990); Cwirla, etal, Proc. Natl Acad. Scl 87:6378- 
6382 (1990); Felici, J. Mol. Biol 222:301-310 (1991); U.S. Patent Nos. 5,096,815 and 
5,198,346; U.S. Patent Nos. 5,223,409; 5,403,484; 5,571,698; and 5,766,905). Methods also 
are available for linking the test polypeptide to the N-terminus or the C-terminus of the phage 
coat protein. 

[0050] A signal generated by the system when a candidate molecule binds to a nucleic acid 
and/or a nucleic acid binding protein often scales directly with a range of increasing nucleic 
acid, nucleic acid binding protein, or candidate molecule concentrations. Signal intensity often 
exhibits a hyperbolic relationship when plotted as a function of nucleic acid, candidate 
molecule, or nucleic acid binding protein concentrations. The signal sometimes is increased 
relative to background signal levels when a candidate molecule binds to a nucleic acid and/or a 
nucleic acid binding protein, and sometimes the signal decreases relative to background signal 
levels under such circumstances. The candidate molecules often interact with the nucleic acid 
and/or nucleic acid binding protein by reversible binding, and sometimes interact with 
irreversible binding. For example, the candidate molecule may reversibly form a covalent 
bond between a portion of the candidate molecule and an amino acid side chain in the protein 
(e.g., a lysine), depending on the chemical structure of the candidate molecule. 

[0051] Candidate molecules often are identified as interacting with the nucleic acid and/or 
nucleic acid binding protein when the signal produced in a system containing the candidate 
molecule is different than the signal produced in a system not containing the candidate 
molecule. While background signals may be assessed each time a new candidate molecule, 
nucleic acid, or nucleic acid binding protein is probed by the assay, detecting the background 
signal often is not required each time a new test molecule or test nucleic acid is assayed. 
Control assays also can be performed to determine background signals and to rule out false 
positive results and false negative results. Such control assays often do not comprise one or 
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more assay components included in other assays (e.g., a control assay sample sometimes does 
not include a candidate molecule, a nucleic acid, or a protein that interacts with the nucleic 
acid). 

[0052] In addition to determining whether a candidate molecule gives rise to a different 
signal, the affinity of the interaction between the candidate molecule with the nucleic acid 
and/or nucleic acid binding protein sometimes is quantified. ICso, Kd, or Kj threshold values 
sometimes are compared to the measured IC50 or K<j values for each interaction, and thereby 
are used to identify a candidate molecule that interacts with the nucleic acid or nucleic acid 
binding protein and modulates the biological activity. For example, IC50 or Kd threshold 
values of 10 (oM or less, 1 jaM or less, and 100 nM or less often are utilized, and sometimes 
threshold values of 10 nM or less, 1 nM or less, 100 pM or less, and 10 pM or less are utilized 
to identify candidate molecules that interact with nucleic acids and/or binding proteins and 
modulate the biological activity. 

[0053] Specific assays sometimes are utilized to identify candidate molecules that 
modulate the biological activity of a nucleic acid and/or binding protein complex. For 
example, fluorescence assays, gel mobility shift assays (see, e.g., Jin & Pike, Mol. Endocrinol. 
10:196-205 (1996) and Postel, J. Biol Chem. 274:22821-22829 (1999)), polymerase arrest 
assays, transcription reporter assays, DNA cleavage assays, protein binding and apoptosis 
assays (see, e.g., Amersham Biosciences (Piscataway, New Jersey)) sometimes are utilized. 
Also, topoisomerase assays sometimes are utilized subsequently to determine whether the 
quadruplex interacting molecules have a topoisomerase pathway activity (see, e.g., TopoGEN, 
Inc. (Columbus, Ohio)). 

[0054] A fluorescence interaction assay is useful for identifying candidate molecules that 
interact with DNA capable of forming a quadruplex structure. In particular, such assays are 
useful in in vitro high-throughput assays and in gel electrophoretic mobility shift assays. Such 
methods sometimes comprise contacting a sample comprising a nucleic acid with a candidate 
molecule, where the nucleic acid includes or consists of nucleotide sequence that is identical or 
substantially similar to a native nucleotide sequence capable of forming a G-quadruplex 
structure. One or more nucleoside moieties in the native nucleotide sequence sometimes are 
substituted with a fluorescent nucleoside analog. Examples of such fluorescent nucleoside 
analogs are 2-amino purine (e.g., 2-amino adenosine), pyrrolo-C, 6-MAP, and furano-dT (for 
other examples, see http address www.glenresearch.com/GlenReports/GR15-13.html). In 
specific embodiments, one or more nucleoside moieties in a loop region of an DP structure in 
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Figure 1 are substituted with a fluorescent nucleoside analog such as 2-amino purine or 
pyrrolo-C. A fluorescent signal generated by the sample is detected after the samplers 
contacted by the candidate molecule, and the test molecule is identified as a candidate 
molecule that interacts with the nucleic acid when the fluorescent signal detected before the 
sample is contacted with the test molecule differs from the fluorescent signal detected after the 
sample is contacted with the test molecule. Fluctuations sometimes are reduced fluorescence 
intensity at a particular wavelength, and sometimes are shifts in the wavelengths at which 
fluorescence is detected. Often, the labeled strand is hybridized with a complementary strand 
and any fluctuations in fluorescence are detected upon hybridization, and the labeled hybrid 
then is contacted with test molecules and fluctuations in fluorescence are detected to determine 
which of the test molecules interact with the labeled nucleic acid. In certain embodiments, the 
sample is contacted with a nucleic acid binding protein such as NM23-H2, Spl, CNBP and/or 
hnRNPK before, at the same time, or after the sample is contacted with the test molecule. In 
other embodiments, the labeled nucleic acid is interacted with test molecules or proteins and 
the reaction products then are subjected to a gel electrophoretic mobility shift assay. 

[0055] Another example of a fluorescence interaction assay is a system that includes a 
nucleic acid, a signal molecule, and a candidate or test molecule. The signal molecule 
generates a fluorescent signal when bound to the nucleic acid {e.g., N-methylmesoporphyrin 
IX (NMM)), and the signal is altered when a candidate compound competes with the signal 
molecule for binding to the nucleic acid. An alteration in the signal when a candidate molecule 
is present as compared to when the candidate molecule is not present identifies the candidate 
molecule as a nucleic acid-interacting molecule. 50 \il of nucleic acid is added in 96-well 
plate. A candidate molecule also is added in varying concentrations. A typical assay is carried 
out in 100 \xl of 20 mM HEPES buffer, pH 7.0, 140 mM NaCl, and 100 mM KC1. 50 |al of the 
signal molecule NMM then is added for a final concentration of 3 jiM. NMM is obtained from 
Frontier Scientific Inc, Logan, Utah. Fluorescence is measured at an excitation wavelength of 
420 nm and an emission wavelength of 660 nm using a FluroStar 2000 fluorometer (BMG 
Labtechnologies, Durham, NC). Fluorescence often is plotted as a function of concentration of 
the candidate molecule or nucleic acid and maximum fluorescent signals for NMM are 
assessed in the absence of these molecules. 

[0056] DNA cleavage assays are useful for determining at which sites of a nucleic acid a 
nucleic acid binding protein interacts, for example. DNA cleavage assays have been reported 
(e.g., Postel, J. Biol Chem., 274:22821-22829 (1999)). In general, a detectable label is 
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incorporated at one portion of the nucleic acid and the label is separated from another portion 
of the nucleic acid having no detectable label or a different detectable label upon cleavage. 
Examples of detectable labels are known, such as fluorophores (e.g., Anantha, et al, 
Biochemistry 37:2709-2714 (1998) and Qu & Chaires, Methods Enzymol 321:353-369 
(2000)), fluorescent nucleotide analogs described above, NMR spectral shifts {see, e.g., 
Arthanari & Bolton, Anti-Cancer Drug Design 14:317-326 (1999)), fluorescence resonance 
energy transfers (see, e.g., Simonsson & SjSback, J. Biol. Chem. 274:17379-17383 (1999)), a 
radioactive isotope (e.g., 125 1, 131 1, 35 S, 32 P, 14 C or 3 H); a light scattering label (see, e.g., U.S. 
Patent No. 6,214,560; Genicon Sciences Corporation, San Diego, CA); an enzymic or protein 
label (e.g., green fluorescent protein (GFP) or peroxidase), or another chromogenic label or 
dye. The nucleic acid also can be linked to two fluorophores for a fluorescence resonance 
energy transfer (FRET) assay, where one fluorophore emits light at a wavelength at which the 
other fluorophore is excited, where such fluorescence energy transfer occurs when the nucleic 
acid is intact and does not occur when the nucleic acid is cleaved by a nucleic acid binding 
protein. Similarly, a candidate molecule linked to a nucleic acid binding protein can be 
detected by detecting the candidate molecule bound to the protein or a detectable label bound 
to a candidate molecule linked to a binding protein. 

[0057] A gel electrophoretic mobility shift assay (EMSA) is useful for dete rmin ing 
whether a nucleic acid forms a quadruplex and whether a nucleotide sequence is quadruplex- 
destabilizing. EMSA is conducted as described previously (Jin & Pike, Mol Endocrinol. 
10:196-205 (1996)) with minor modifications. Synthetic single-stranded oligonucleotides are 
labeled in the 5' terminus with T4-kinase in the presence of [y- 32 P] ATP (1,000 mCi/mmol, 
Amersham Life Science) and purified through a sephadex column. 32 P-labeled 
oligonucleotides (-30,000 cpm) then are incubated with or without various concentrations of a 
testing compound in 20 pi of a buffer containing 10 mM Tris pH 7.5, 100 mM KC1, 5 mM 
dithiothreitol, 0.1 mM EDTA, 5 mM MgCl 2 , 10% glycerol, 0.05% Nonedit P-40, and 0.1 
mg/ml of poly(dI-dC) (Pharmacia). After incubation for 20 minutes at room temperature, 
binding reactions are loaded on a 5% polyacrylamide gel in 0.25 x Tris borate-EDTA buffer 
(0.25 x TBE, 1 x TBE is 89 mM Tris-borate, pH 8.0, 1 mM EDTA). The gel is dried and each 
band is quantified using a phosphorimager. 

[0058] Another example of an EMSA assay is performed as follows. Ten microliter 
reactions are assembled in Reaction Buffer (50 mM Tris-HCl, pH 7.9, 0.5 mM dithiothreitol, 
and 50 mg/ml bovine serum albumin). MgCl 2 , KC1, EDTA, protease K, and ATP are added. 
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Radiolabeled DNA or fluorescently labeled DNA (described above) andNM23-H2 in storage 
buffer (20 mM Hepes, pH 7.9, 5 mM MgCl 2 , 0.1 mM EDTA, 0.1 M KC1, 1 mM dithiothreitol, 
20% glycerol, and protease inhibitors (Postel, etal, Mol. Cell. Biol. 9:5123-5133 (1989)) are 
added last, and the reactions are incubated for 15 minutes at room temperature. To separate the 
protein-DNA complexes, the reactions are loaded onto 5% native polyacrylamide gels and 
electrophoresed in 0.53 TBE buffer (45 mM Tris borate, pH 8.3, 1.25 mM EDTA) at room 
temperature for 30 minutes at 100 V. Gels are vacuum-dried and exposed onto XAR (Eastman 
Kodak Co.) film. 

[0059] Chemical footprinting assays are useful for assessing quadruplex structure. 
Quadruplex structure is assessed by determining which nucleotides in a nucleic acid is 
protected or unprotected from chemical modification as a result of being inaccessible or 
accessible, respectively, to the modifying reagent. A DMS methylation assay is an example of 
a chemical footprinting assay. In such an assay, bands from EMSA are isolated and subjected 
to DMS-induced strand cleavage. Each band of interest is excised from an electrophoretic 
mobility shift gel and soaked in 100 mM KC1 solution (300 ul) for 6 hours at 4°C. The 
solutions are filtered (microcentrifuge) and 30,000 cpm (per reaction) of DNA solution is 
diluted further with 100 mM KC1 in 0.1X TE to a total volume of 70 ul (per reaction). 
Following the addition of 1 ul salmon sperm DNA (0.1 ug/ul), the reaction mixture is 
incubated with 1 ul DMS solution (DMS:ethanol; 4: 1; v:v) for a period of time. Each reaction 
is quenched with 18 ul of stop buffer (b -mercaptoathanol:water:NaOAc (3 M); 1:6:7; v:v:v). 
Following ethanol precipitation (twice) and piperidine cleavage, the reactions are separated on 
a preparative gel (16%) and visualized on a phosphorimager. 

[0060] A polymerase arrest assay is useful for determining whether a candidate molecule 
stabilizes a quadruplex structure. Such an assay includes a template nucleic acid, which often 
comprises a quadruplex forming sequence, and a primer nucleic acid that hybridizes to the 
template nucleic acid 5' of the quadmplex-forming sequence. The primer is extended by a 
polymerase (e.g., Taq polymerase), which advances from the primer along the template nucleic 
acid. In this assay, a quadruplex structure can block or arrest the advance of the enzyme, 
leading to shorter transcription fragments. Also, the arrest assay may be conducted at a variety 
of temperatures, including 45°C and 60°C, and at a variety of ion concentrations. An example 
of a Taq polymerase stop assay is described in Han, et al, Nucl. Acids Res. 27:537-542 (1999), 
which is a modification of that used by Weitzmann, et ah, J. Biol. Chem. 271, 20958-20964 
(1996). Briefly, a reaction mixture of template DNA (50 nM), Tris»HCl (50 mM), MgCl 2 (10 
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mM), DTT (0.5 mM), EDTA (0.1 mM), BSA (60 ng), and 5 '-end-labeled quadruplex nucleic 
acid (-18 nM) is heated to 90°C for 5 minutes and allowed to cool to ambient temperature over 
30 minutes. Taq Polymerase (1 is added to the reaction mixture, and the reaction is 
maintained at a constant temperature for 30 minutes. Following the addition of 10 \il stop 
buffer (formamide (20 ml), 1 M NaOH (200 |il), 0.5 M EDTA (400 ^1), and 10 mg 
bromophenol blue), the reactions are separated on a preparative gel (12%) and visualized on a 
phosphorimager. Adenine sequencing (indicated by "A" at the top of the gel) is performed 
using double-stranded DNA Cycle Sequencing System from Life Technologies. The general 
sequence for the template strands is TCCAACTATGTATAC-INSERT- 
TTAGCGACACGCAATTGCTATAGTGAGTCGTATTA. Bands on the gel that exhibit 
slower mobility are indicative of quadruplex formation. 

[0061] In another example of a polymerase arrest assay, a 5'- fluorescent-labeled (FAM) 
primer (P45, 15 nM) is mixed with template DNA (15nM) in a Tris-HCL buffer (15 mM Tris, 
pH 7.5) containing lOmM MgCl 2 , O.lmM EDTA and O.lmM mixed deoxynucleotide 
triphosphates (dNTP's). The FAM-P45 primer (5'- 6FAM- 

AGTCTGACTGACTGTACGTAGCTAATACGACTCACTATAGCAATT-3 , ) and the 
template DNA (5 '-TCCAACTATGTATACTGGGGA 
GGGTGGGGAGGGTGGGGAAGGTT AGCGACACGCAATTGCTATAG 
TGAGTCGTATTAGCTACGTACAGTCAGTCAGACT-3') are synthesized andHPLC 
purified by Applied Biosystems. The mixture is denatured at 95°C for 5 minutes and, after 
cooling down to room temperature, is incubated at 37°C for 15 minutes. After cooling down to 
room temperature, ImM KC1 2 and the test compound (various concentrations) are added and 
the mixture incubated for 15 minutes at room temperature. The primer extension is performed 
by adding lOmM KC1 and Taq DNA Polymerase (2.5 U/reaction, Promega) and incubating at 
70°C for 30 minutes. The reaction is stopped by adding 1 nl of the reaction mixture to 10 jal 
Hi-Di Formamide mixed and 0.25 jil LIZ120 size standard. Hi-Di Formamide and LIZ120 
size standard are purchased from Applied Biosystems. The partially extended quadruplex 
arrest product is between 61 or 62 bases long and the full-length extended product is 99 bases 
long. The products are separated and analyzed using capillary electrophoresis. Capillary 
electrophoresis is performed using an ABI PRISM 3100-Avant Genetic Analyzer. 

[0062] Certain arrest assays are performed in cells. In a transcription reporter assay, test 
quadruplex DNA is coupled to a reporter system, such that a formation or stabilization of a 
quadruplex structure can modulate a reporter signal. An example of such a system is a reporter 
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expression system in which a polypeptide, such as luciferase or green fluorescent protein 
(GFP), is expressed by a gene operably linked to the potential quadruplex forming nucleic acid 
and expression of the polypeptide can be detected. As used herein, the term "operably linked" 
refers to a nucleotide sequence which is regulated by a sequence comprising the potential 
quadruplex forming nucleic acid. A sequence may be operably linked when it is on the same 
nucleic acid as the quadruplex DNA, or on a different nucleic acid. An exemplary luciferase 
reporter system is described herein. A luciferase promoter assay described in He, et ah, 
Science 281:1509-1512 (1998) often is utilized for the study of quadruplex formation. 
Specifically, a vector utilized for the assay is set forth in reference 1 1 of the He, et al, 
document. In this assay, HeLa cells are transfected using the lipofectamin 2000-based system 
(Invitrogen) according to the manufacturer's protocol, using 0.1 jig of pRL-TK (Renilla 
luciferase reporter plasmid) and 0.9 \xg of the quadruplex-forming plasmid. Firefly and Renilla 
luciferase activities are assayed using the Dual Luciferase Reporter Assay System (Promega) 
in a 96-well plate format according to the manufacturer's protocol. 

[0063] Circular dichroism (CD) sometimes is utilized to determine whether a candidate 
molecule interacts with a quadruplex nucleic acid. CD is particularly useful for determining 
whether a candidate molecule interacts with a nucleic acid in vitro. In certain embodiments, a 
candidate molecule is added to a DNA sample (5 \xM each) in a buffer containing 10 mM 
potassium phosphate (pH 7.2) and 10 or 250 mM KC1 at 37°C and then allowed to stand for 5 
min at the same temperature before recording spectra. CD spectra are recorded on a Jasco J- 
715 spectropolarimeter equipped with a thermoelectrically controlled single cell holder. CD 
intensity normally is detected between 220 nm and 320 nm and comparative spectra for DNA 
alone, candidate molecule alone, and the DNA with the candidate molecule are generated to 
determine the presence or absence of an interaction (see e.g. Datta et al. 9 JACS 123:9612-9619 
(2001)). Spectra are arranged to represent the average of eight scans recorded at 100 nm/min. 
In certain embodiments, CD signals are monitored as a function of temperature. For example, 
CD signals indicative of the presence of a quadruplex structure in a sample can be determined 
and the loss or shift of those signals can be followed as a function of increasing temperature to 
determine at which temperature the quadruplex structure melts. The signals often are 
monitored in the presence of test molecule and in the absence of test molecule, and stabilizing 
molecules that increase the melting temperature of the quadruplex structure are identified as 
candidate molecules. 
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[0064] A cell proliferation assay is useful for assessing the utility of a candidate molecule 
for treating a cell proliferative disorder in a subject. In a cancer cell proliferation assay, cell 
proliferation rates are assessed as a function of different concentrations of test compounds 
added to the cell culture medium. Any cancer cell type can be utilized in the assay. In one 
embodiment, colon cancer cells are cultured in vitro and test compounds are added to the 
culture medium at varying concentrations. A useful colon cancer cell line is colo320, which is 
a colon adenocarcinoma cell line deposited with the National Institutes of Health as accession 
number JCRB0225. Parameters for using such cells are available at the http address 
cellbank.nihs.go.jp/cell/data/jcrb0225 .htm. 

Utilization of Candidate Molecules as Therapeutics 

[0065] Because quadruplexes are regulators of biological processes such as oncogene 
transcription, modulators of quadruplex biological activity can be utilized as cancer 
therapeutics. For example, molecules that stabilize quadruplex structures can exert a 
therapeutic effect for certain cell proliferative disorders and related conditions because 
quadruplex structures typically down-regulate the oncogene expression which can cause cell 
proliferative disorders. Quadraplex-interacting candidate molecules can exert a biological 
effect according to different mechanisms, which include, for example, stabilizing a native 
quadruplex structure, inhibiting conversion of a native quadruplex to duplex DNA, and 
stabilizing a native quadruplex structure having a quadruplex-destabilizing nucleotide 
substitution. Thus, quadruplex interacting candidate molecules described herein may be 
administered to cells, tissues, or organisms, thereby down-regulating oncogene transcription 
and treating cell proliferative disorders. The terms "treating," 'treatment" and 'therapeutic 
effect" as used herein refer to reducing or stopping a cell proliferation rate (e.g., slowing or 
halting tumor growth) or reducing the number of proliferating cancer cells (e.g., removing part 
or all of a tumor) and refers to alleviating, completely or in part, a cell proliferation condition. 

[0066] Quadruplex interacting molecules and quadruplex forming nucleic acids can be 
utilized to target a cell proliferative disorder. Cell proliferative disorders include, for example, 
colorectal cancers. Other examples of cancers include hematopoietic neoplastic disorders, 
which are diseases involving hyperplastic/neoplastic cells of hematopoietic origin (e.g., arising 
from myeloid, lymphoid or erythroid lineages, or precursor cells thereof). The diseases can 
arise from poorly differentiated acute leukemias, e.g., erythroblastic leukemia and acute 
megakaryoblastic leukemia. Additional myeloid disorders include, but are not limited to, acute 
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promyeloid leukemia (APML), acute myelogenous leukemia (AML) and chronic myelogenous 
leukemia (CML) (reviewed in Vaickus, Crit. Rev. in Oncol/Hemotol 1 1:267-297 (1991)); 
lymphoid malignancies include, but are not limited to acute lymphoblastic leukemia (ALL), 
which includes B-lineage ALL and T-lineage ALL, chronic lymphocytic leukemia (CLL), 
prolymphocyte leukemia (PLL), hairy cell leukemia (HLL) and Waldenstrom ! s 
macroglobulinemia (WM). Additional forms of malignant lymphomas include, but are not 
limited to non-Hodgkin lymphoma and variants thereof, peripheral T cell lymphomas, adult T 
cell leukemia/lymphoma (ATL), cutaneous T-cell lymphoma (CTCL), large granular 
lymphocytic leukemia (LGF), Hodgkin's disease and Reed-Sternberg disease. Cell 
proliferative disorders also include cancers of the colorectum, breast, lung, liver, pancreas, 
lymph node, colon, prostate, brain, head and neck, skin, liver, kidney, and heart. Candidate 
molecules also can be utilized to target cancer related processes and conditions, such as 
increased angiogenesis, by inhibiting angiogenesis in a subject (e.g., molecules that stabilize a 
VEGF associated quadruplex structure can inhibit angiogenesis). 

[0067] Thus, provided herein are methods for reducing cell proliferation or for treating or 
alleviating cell proliferative disorders, which comprise contacting a system having a nucleic 
acid comprising a native quadruplex with a candidate molecule identified herein. The system 
sometimes is a group of cells or one or more tissues, and often is a subject in need of a 
treatment of a cell proliferative disorder. A subject often is a mammal such as a mouse, rat, 
monkey, or human. One embodiment is a method for treating colorectal cancer by 
administering a candidate molecule that interacts with a CMYC regulatory nucleotide sequence 
to a subject in need thereof, thereby reducing the colorectal cancer cell proliferation. Another 
embodiment is a method for inhibiting angiogenesis and optionally treating a cancer associated 
with angiogenesis, which comprises administering a candidate molecule that interacts with a 
VEGF regulatory nucleotide sequence to a subject in need thereof, thereby reducing 
angiogenesis and optionally treating a cancer associated with angiogenesis. In another 
embodiment, a candidate molecule that interacts with a HER2/neu G-quadruplex is 
administered to a subject for the treatment of a fat cell proliferative disorder such as obesity. 

[0068] Retroviruses offer a wealth of potential targets for G-quadruplex targeted 
therapeutics. G-quadruplex structures have been implicated as functional elements in at least 
two critical secondary structures formed by either viral RNA or DNA in HIV, the dimer linker 
structure (DLS) and the central DNA flap (CDF). Additionally, DNA aptamers which are able 
to adopt either inter- or intramolecular quadruplex structures are able to inhibit viral 
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replication, by targeting either the envelope glycoprotein (putatively) or HlV-integrase 
respectively. Although not direct evidence, the latter observation indicates an involvement of 
native quadruplex structures in interaction with the integrase enzyme. 

[0069] Dimer linker structures, which are common to all retroviruses, serve to bind two 
copies of the viral genome together by a non-covalent interaction between the two 5' ends of 
the two viral RNA sequences. The genomic dimer is stably associated with the gag protein in 
the mature virus particle. In the case of HTV, the origin of this non-covalent binding, may be 
traced to a 98 base-pair sequence containing several runs of at least two consecutive guanines, 
the 3'-most of which is critical for the formation of RNA dimers in vitro. An observed cation 
(potassium) dependence for the formation and stability of the dimer in vitro, in addition to the 
failure of an antisense sequence to effectively dimerize, has revealed the most likely binding 
structure to be an intermolecular G-quadruplex. 

[0070] Prior to integration into the host genome, reverse transcribed viral DNA forms a 
pre-integration complex (PIC) with at least two major viral proteins, integrase and reverse 
transcriptase, which is subsequently transported into the nucleus by an as yet undefined 
mechanism. The Central DNA Flap (CDF) refers to 99-base length single-stranded tail of the 
+ strand, occurring near the center of the viral duplex DNA, which is known to a play a role in 
the nuclear import of the PIC. Oligonucleotide mimics of the CDF have been shown to form 
intermolecular G-quadruplex structures in cell-free systems. 

[0071] Thus, candidate molecules can be used to stabilize the DLS and thus prevent de- 
coupling of the two RNA strands, an event which is necessary for viral replication. Also, by 
binding to the quadruplex structure formed by the CDF, critical protein recognition and/or 
binding events necessary for nuclear transport of the PIC may be disrupted. In either case, a 
substantial advantage can exist over other anti-viral therapeutics. Current Highly Active Anti- 
Retroviral Therapeutic (HAART) regimes rely on the use of combinations of drugs targeted 
towards the HTV protease and HTV integrase. The requirement for multi-drug regimes is to 
minimize the emergence of resistance, which will usually develop rapidly when agents are 
used in isolation. The source of such rapid resistance is the infidelity of the reverse 
transcriptase enzyme which makes a mutation approx. once in every 10,000 base pairs. An 
advantage of targeting critical viral quadruplex structures over protein targets, is that the 
development of resistance is slow or is impossible. A point mutation of the target quadruplex, 
necessary to reduce affinity for the candidate molecule, can compromise the integrity of the 
critical quadruplex structure and lead to a non-functional copy of the virus. A single 
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therapeutic agent based on this concept may replace the multiple drug regimes currently 
employed, with the concomitant benefits of reduced costs and the elimination of harmful 
drug/drug interactions. 

[0072] Thus, provided herein are methods for inhibiting viral propagation in a system, 
which comprise contacting a system having a viral quadruplex-forming nucleic acid sequence 
with a candidate molecule described herein. The system sometimes is a group of cells or one . 
or more tissues, and often is a subject in need of a treatment of a viral infection (e.g. , a 
mammal such as a mouse, rat, monkey, or human). In an embodiment, provided is a method 
for treating HIV infection by administering a candidate molecule identified herein to a subject 
in need thereof, thereby reducing the HIV titers in the systems and alleviating infection. 

[0073] Any suitable formulation of the candidate molecules described herein can be 
prepared for administration. Any suitable route of administration may be used, including but 
not limited to oral, parenteral, intravenous, intramuscular, topical and subcutaneous routes. 

[0074] In cases where candidate molecules are sufficiently basic or acidic to form stable 
nontoxic acid or base salts, administration of the candidate molecules as salts may be 
appropriate. Examples of pharmaceutically acceptable salts are organic acid addition salts 
formed with acids that form a physiological acceptable anion, for example, tosylate, 
methanesulfonate, acetate, citrate, malonate, tartarate, succinate, benzoate, ascorbate, a- 
ketoglutarate, and a-glycerophosphate. Suitable inorganic salts may also be formed, including 
hydrochloride, sulfate, nitrate, bicarbonate, and carbonate salts. Pharmaceutically acceptable 
salts are obtained using standard procedures well known in the art, for example by reacting a 
sufficiently basic candidate molecule such as an amine with a suitable acid affording a 
physiologically acceptable anion. Alkali metal (e.g., sodium, potassium or lithium) or alkaline 
earth metal (e.g., calcium) salts of carboxylic acids also are made. 

[0075] In one embodiment, a candidate molecule is administered systemically (e.g., orally) 
in combination with a pharmaceutically acceptable vehicle such as an inert diluent or an 
assimilable edible carrier. They may be enclosed in hard or soft shell gelatin capsules, 
compressed into tablets, or incorporated directly with the food of the patient's diet. For oral 
therapeutic administration, the active candidate molecule may be combined with one or more 
excipients and used in the form of ingestible tablets, buccal tablets, troches, capsules, elixirs, 
suspensions, syrups, wafers, and the like. Such compositions and preparations should contain 
at least 0.1% of active candidate molecule. The percentage of the compositions and 
preparations may be varied and may conveniently be between about 2 to about 60% of the 
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weight of a given unit dosage form. The amount of active candidate molecule in such 
therapeutically useful compositions is such that an effective dosage level will be obtained. 

[0076] Tablets, troches, pills, capsules, and the like also may contain the following: 
binders such as gum tragacanth, acacia, com starch or gelatin; excipients such as dicalcium 
phosphate; a disintegrating agent such as com starch, potato starch, alginic acid and the like; a 
lubricant such as magnesium stearate; and a sweetening agent such as sucrose, fructose, lactose 
or aspartame or a flavoring agent such as peppermint, oil of wintergreen, or cherry flavoring 
may be added. When the unit dosage form is a capsule, it may contain, in addition to materials 
of the above type, a liquid carrier, such as a vegetable oil or a polyethylene glycol. Various 
other materials may be present as coatings or to otherwise modify the physical form of the 
solid unit dosage form. For instance, tablets, pills, or capsules may be coated with gelatin, 
wax, shellac or sugar and the like. A syrup or elixir may contain the active candidate 
molecule, sucrose or fructose as a sweetening agent, methyl and propylparabens as 
preservatives, a dye and flavoring such as cherry or orange flavor. Any material used in 
preparing any unit dosage form is pharmaceutical^ acceptable and substantially non-toxic in 
the amounts employed. In addition, the active candidate molecule may be incorporated into 
sustained-release preparations and devices. 

[0077] The active candidate molecule also may be administered intravenously or 
intraperitoneally by infusion or injection. Solutions of the active candidate molecule or its 
salts may be prepared in a buffered solution, often phosphate buffered saline, optionally mixed 
with a nontoxic surfactant. Dispersions can also be prepared in glycerol, liquid polyethylene 
glycols, triacetin, and mixtures thereof and in oils. Under ordinary conditions of storage and 
use, these preparations contain a preservative to prevent the growth of microorganisms. The 
candidate molecule is sometimes prepared as a polymatrix-containing formulation for such 
administration {e.g., a liposome or microsome). Liposomes are described for example in U.S. 
Patent No. 5,703,055 (Feigner, et al) and Gregoriadis, Liposome Technology vols. I to III 
(2nd ed. 1993). 

[0078] The pharmaceutical dosage forms suitable for injection or infusion can include 
sterile aqueous solutions or dispersions or sterile powders comprising the active ingredient that 
are adapted for the extemporaneous preparation of sterile injectable or infusible solutions or 
dispersions, optionally encapsulated in liposomes. In all cases, the ultimate dosage form 
should be sterile, fluid and stable under the conditions of manufacture and storage. The liquid 
carrier or vehicle can be a solvent or liquid dispersion medium comprising, for example, water, 
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ethanol, a polyol (for example, glycerol, propylene glycol, liquid polyethylene glycols, and the 
like), vegetable oils, nontoxic glyceryl esters, and suitable mixtures thereof. The proper 
fluidity can be maintained, for example, by the formation of liposomes, by the maintenance of 
the required particle size in the case of dispersions or by the use of surfactants. The prevention 
of the action of microorganisms can be brought about by various antibacterial and antifungal 
agents, for example, parabens, chlorobutanol, phenol, sorbic acid, thimerosal, and the like. In 
many cases, it will be preferable to include isotonic agents, for example, sugars, buffers or 
sodium chloride. Prolonged absorption of the injectable compositions can be brought about by 
the use in the compositions of agents delaying absorption, for example, aluminum 
monostearate and gelatin. 

[0079] Sterile injectable solutions are prepared by incorporating the active candidate 
molecule in the required amount in the appropriate solvent with various of the other ingredients 
enumerated above, as required, followed by filter sterilization. In the case of sterile powders 
for the preparation of sterile injectable solutions, the preferred methods of preparation are 
vacuum drying and the freeze drying techniques, which yield a powder of the active ingredient 
plus any additional desired ingredient present in the previously sterile-filtered solutions. 

[0080] For topical administration, the present candidate molecules may be applied in liquid 
form. Candidate molecules often are a&ninistered as compositions or formulations, in 
combination with a dermatologically acceptable carrier, which may be a solid or a liquid. 
Examples of useful dermatological compositions used to deliver candidate molecules to the 
skin are known (see, e.g., Jacquet, et ah (U.S. Pat. No. 4,608,392), Geria (U.S. Pat. No. 
4,992,478), Smith, et al. (U.S. Pat. No. 4,559,157) and Wortzman (U.S. Pat. No. 4,820,508). 

[0081] Candidate molecules may be formulated with a solid carrier, which include finely 
divided solids such as talc, clay, microcrystalline cellulose, silica, alumina and the like. Useful 
liquid carriers include water, alcohols or glycols or water-alcohol/glycol blends, in which the 
present candidate molecules can be dissolved or dispersed at effective levels, optionally with 
the aid of non-toxic surfactants. Adjuvants such as fragrances and additional antimicrobial 
agents can be added to optimize the properties-for-a given use. The resultant liquid 
compositions can be applied from absorbent pads, used to impregnate bandages and other 
dressings, or sprayed onto the affected area using pump-type or aerosol sprayers. Thickeners 
such as synthetic polymers, fatty acids, fatty acid salts and esters, fatty alcohols, modified 
celluloses or modified mineral materials can also be employed with liquid carriers to form 
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spreadable pastes, gels, ointments, soaps, and the like, for application directly to the skin of the 
user. 

[0082] Generally, the concentration of the candidate molecule in a liquid composition often 
is from about 0.1 wt% to about 25 wt%, sometimes from about 0.5 wt% to about 10 wt%. The 
concentration in a semi-solid or solid composition such as a gel or a powder often is about 0.1 
wt% to about 5 wt%, sometimes about 0.5 wt% to about 2.5 wt%. A candidate molecule 
composition may be prepared as a unit dosage form, which is prepared according to • 
conventional techniques known in the pharmaceutical industry. In general terms, such 
techniques include bringing a candidate molecule into association with pharmaceutical 
carriers) and/or excipient(s) in liquid form or finely divided solid form, or both, and then 
shaping the product if required. The candidate molecule composition may be formulated into 
any dosage form, such as tablets, capsules, gel capsules, liquid syrups, soft gels, suppositories, 
and enemas. The compositions also may be formulated as suspensions in aqueous, non- 
aqueous, or mixed media. Aqueous suspensions may further contain substances which 
increase viscosity, including for example, sodium carboxymethylcellulose, sorbitol, and/or 
dextran. The suspension may also contain one or more stabilizers. 

[0083] The amount of the candidate molecule, or an active salt or derivative thereof, 
required for use in treatment will vary not only with the particular salt selected but also with 
the route of adininistration, the nature of the condition being treated and the age and condition 
of the patient and will be ultimately at the discretion of the attendant physician or clinician. 

[0084] A useful candidate molecule dosage often is determined by assessing its in vitro 
activity in a cell or tissue system and/or in vivo activity in an animal system. For example, 
methods for extrapolating an effective dosage in mice and other animals to humans are known 
to the art (see, e.g., U.S. Pat. No. 4,938,949). Such systems can be used for determining the 
LD 50 (the dose lethal to 50% of the population) and the ED 50 (the dose therapeutically effective 
in 50% of the population) of a candidate molecule. The dose ratio between a toxic and 
therapeutic effect is the therapeutic index and it can be expressed as the ratio ED50/LD50. The 
candidate molecule dosage often lies within a range of circulating concentrations for which the 
ED 50 is associated with little or no toxicity. The dosage may vary within this range depending 
upon the dosage form employed and the route of administration utilized. For any candidate 
molecules used in the methods described herein, the therapeutically effective dose can be 
estimated initially from cell culture assays. A dose sometimes is formulated to achieve a 
circulating plasma concentration range covering the IC50 (j e., the concentration of the test 
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candidate molecule which achieves a half-maximal inhibition of symptoms) as determined in 
in vitro assays, as such information often is used to more accurately determine useful doses in 
humans. Levels in plasma may be measured, for example, by high performance liquid 
chromatography. 

[0085] Another example of effective dose determination for a subject is the ability to 
directly assay levels of "free" and "bound" candidate molecule in the serum of the test subject. 
Such assays may utilize antibody mimics and/or "biosensors" generated by molecular 
imprinting techniques. The candidate molecule is used as a template, or "imprinting 
molecule", to spatially organize polymerizable monomers prior to their polymerization with 
catalytic reagents. Subsequent removal of the imprinted molecule leaves a polymer matrix 
which contains a repeated "negative image" of the candidate molecule and is able to selectively 
rebind the molecule under biological assay conditions (see, e.g., Ansell, et al., Current Opinion 
in Biotechnology 7: 89-94 (1996) and in Shea, Trends in Polymer Science 2: 166-173 (1994)). 
Such "imprinted" affinity matrixes are amenable to ligand-binding assays, whereby the 
immobilized monoclonal antibody component is replaced by an appropriately imprinted matrix 
(see, e.g., Vlatakis, et al., Nature 361: 645-647 (1993)). Through the use of isotope-labeling, 
"free" concentration of candidate molecule can be readily monitored and used in calculations 
of IC 50 . Such "imprinted" affinity matrixes can also be designed to include fluorescent groups 
whose photon-emitting properties measurably change upon local and selective binding of 
candidate molecule. These changes can be readily assayed in real time using appropriate fiber 
optic devices, in turn allowing the dose in a test subject to be quickly optimized based on its 
individual IC 50 . An example of such a "biosensor" is discussed in Kriz, et al, Analytical 
Chemistry 67: 2142-2144 (1995). 

[0086] Exemplary doses include milligram or microgram amounts of the candidate 
molecule per kilogram of subject or sample weight, for example, about 1 microgram per 
kilogram to about 500 milligrams per kilogram, about 100 micrograms per kilogram to about 5 
milligrams per kilogram, or about 1 microgram per kilogram to about 50 micrograms per 
kilogram. It is understood that appropriate doses of a small molecule depend upon the potency 
of the small molecule with respect to the expression or activity to be modulated. When one or 
more of these small molecules is to be administered to an animal (e.g., a human) in order to 
modulate expression or activity of a polypeptide or nucleic acid described herein, a physician, 
veterinarian, or researcher may, for example, prescribe a relatively low dose at first, 
subsequently increasing the dose until an appropriate response is obtained. In addition, it is 



33 



WO 2005/049867 



PCT/US2004/037789 



understood that the specific dose level for any particular animal subject will depend upon a 
variety of factors including the activity of the specific candidate molecule employed, the age, 
body weight, general health, gender, and diet of the subject, the time of administration, the 
route of administration, the rate of excretion, any drug combination, and the degree of 
expression or activity to be modulated. 

[0087] Each document and publication cited is incorporated herein by reference in its 
entirety, including all figures, drawings, tables, text, and documents and publications 
referenced therein. 
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